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They are not capable 
to ground a canonicity 
of universal consistency. 

— Alexandra Deligiorgi 
(HAIAEIA, 1998) 



Abstract 



We explore how different proof orderings induce different notions of satura- 
tion. We relate completion, paramodulation, saturation, redundancy elimination, 
and rewrite system reduction to proof orderings. 



1 Introduction 

We show how to define the canonical basis of an abstract deductive system 
in three distinct ways: (1) Formulae appearing in minimal proofs; (2) non- 
redundant lemmata; (3) minimal trivial theorems. Well-founded orderings of proofs 
(Bachmair a nd Dershowitz, 1 9941 are used to distinguish between cheap "direct" 
proofs, those that are of a computational flavor (e.g. rewrite proofs), and expensive 
"indirect" proofs, those that require search to find. This approach suggests generaliza- 
tions of the concepts of "redundancy" and "saturation", as elaborated by Nieuwenhuis 
and Rubio in 12001 1. Saturated, for us, means that all cheap proofs are supported. By 
considering different orderings on proofs, one gets different kinds of saturated sets. 

This work continues our development of an abstract theory of "canonical infer- 
ence", initiated in [Dershowitz and Kirchner, 2003b I. Although we will use ground 
equations as an illustrative example, the framework applies equally well in the first- 
order setting, whether equational or clausal. Though our motivation is primarily aes- 
thetic; our expectation is that practical applications will follow. 

* Supported in part by the Israel Science Foundation. 
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Proofs and other details omitted here will be included in 
IDershowitz and Kirchn er, 2003a| . Our rewriting terminology accords with 
pDershowitz and Plaisted, 200 1| . 



2 Proof Systems 

Let A be the set of all formulae (ground equations and disequations, in our examples) 
over some fixed vocabulary. Let P be the set of all (ground equational) proofs. We are 
given two functions: F : P ^ 2* gives the assumptions in a proof, and A : P ^ A 
gives its conclusion. Both are extended to sets of proofs in the usual fashion. (We 
assume for simplicity that proofs use only a finite number of assumptions.) 

The framework proposed here is predicated on two well-founded partial orderings 
over P; a proof ordering >; and a subproof relation >. They are related by a mono- 
tonicity requirement given below Q. 

We will use the term presentation to mean a set of formulae, and justification to 
mean a set of proofs. We reserve the term theory for deductively closed presentations. 
Let A* denote the theory of presentation A, that is, the set of conclusions of all proofs 
with assumptions A: 

A* := Ar-M = {Ap:pGP, rp = A} (1) 

We assume the following three standard properties of Tarskian consequence relations: 

A* C {A\JB)* (2) 
A <Z A* (3) 
A** = A* (4) 

Thus, -* is a closure operation. We say that presentation A is a basis for theory B if 
A* = B. Presentations A and B are equivalents their theories are identical: A* = B*. 

As a very simple running example, let the vocabulary consist of the constant and 
unary symbol s. Abbreviate tally terms s'O as numeral i. The set A consists of all 
unordered equations i — j (so symmetry is built into the structure of proofs). We 
postpone dealing with disequations for the time being. An equational inference system 
for this vocabulary might consist of the following inference rules: 



° Z 



I, 



0-0 ^ z = J 



i — k si = sj 

where the proof tree branches (of T) are unordered. To accommodate (|2j, we also need 
projection: 

a c 



c 

For example, if ^ = {4 = 2, 4 = 0}, then A* = {i = j : i = j (mod 2)}. 
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Consider the proof schemata: 



□ 

= 

1 = 1 



4 = 
4 = 



4 = 2 



3 = 



I- J -1=1 



« = J 

Let's use proof terms for proofs, denoting the above two trees by S^Z and 
S^T{T[I{A, 0), /(4, 2)), S'S'(V,_j_2=o)), respectively. With a recursive path order- 
ing to order proofs, precedence Z<S'<r<J<P<0<l<2<--, and 
multiset "status" for /, minimal proofs of the theorems in A* must take one of these 
two forms, or the form of one of their subproofs. 

We call a proof trivial when it proves only itself and has no subproofs other than 
itself, that is, if = {Ap} and p ^ q ^ p = q. We denote by a such a trivial proof 
of a G A and by A the set of trivial proofs of each a ^ A. We assume that proofs 
use their assumptions, that subproofs don't use non-existent assumptions, and — most 
significantly — that proof orderings are monotonic with respect to subproofs. Specifi- 
cally, for all proofs p, q, r and formulae a: 



a e Tp 
p\>q 

p\> q > r A Aq = Ar 



p>a 
Tp D Tq 

3v ef.p > V \> r A Ap ■ 



Av 



(5) 
(6) 

(7) 



We make no other assumptions regarding proofs or their structure. 

Every formula a admits a trivial proof a by ( I3I5> . Let = {q - p ^ q} denote 
the subproofs of p, and likewise SP = UpgpSp. This way, (jsj can be abbreviated 
Fp C Sp. 

It may be convenient to think of a proof-tree "leaf" as a subproof with only itself 
as a subproof; other subproofs are the "internal nodes". There are two kinds of leaves: 
trivial proofs a (such as inferences I), and vacuous proofs a with F a = and A a = a 
(such as Z). By well-foundedness of [>, there are no infinite "paths" in proof trees. 

Postulate Q states that > (restricted to proofs with the same conclusion) and [> 
commute (i.e. o o > C > o[>), from which it follows that their union, the partial 
ordering (o U >)*, is also well-founded. 



3 Canonical Systems 

Denote the set of proofs using assumptions A by: 

UA := {peF: TpCA} 



3 



and define the minimal proofs in a set of proofs as: 

fiP := {peP: -Bq G P. Aq = Ap, q < p} 

On account of well-foundedness, minimal proofs always exist. 

Proposition 1 For all presentations A, B: 

THA C a (8) 
Y.^lIiA C iiUA (9) 

nrnA = wa (lo) 
nA = ns ^ a = b (11) 

And for all justifications P: P 'ZliT P. 

We say that presentation A is reduced when ^4 = F pliA. Our main definition is: 

Definition 1 (Canonical Presentation) The cwomcdl^v&sesAaiKm contains those for- 
mulae that appear as assumptions of minimal proofs: 

A^ := T^nA* 

So, we say that A is canonical ifA = AK 

Proof orderings are lifted to sets of proofs, as follows: 

Definition 2 Justification Q is better than justification P if: 

P^Q ■= VpeP.3qeQ.Aq = ApAp>q 

It is much better if: 

P^Q := "ip G P.3q G Q. Aq = ApAp> q 

Justifications are similar if: 

P c^Q ■= P^Q^P 

These three relations are compatible: ZIodCd, □o~C3, etc. 
The following statements can be shown to hold: 

Proposition 2 For all justifications P, Q: 

P ^ liP (12) 

P □ Q ^ tiP^ tiQ (13) 

PZiQ <^ fJ-P^fiQ (14) 

P~g ^ ^P = nQ (15) 
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Proposition 3 The relation □ is a quasi-ordering on proofs and a partial ordering of 
minimal proofs. 

This quasi-ordering on proofs is lifted to sets of formulae as follows: 

Definition 3 Presentation B is said to be simpler than an equivalent presentation A 
when B provides better proofs than does A: A B := A* = B* A HA □ Hi?. 
Presentations are similar if:A^B := HA ~ HB. 

These relations are also compatible. 

Proposition 4 For all presentations A, B: 

UA □ U{AUB) (16) 

A^B ^ pUA = /lUB (17) 

B C AaUA^UB AkB (18) 

AQB AA* =B* A>zB (19) 

Proposition 5 The relation ^ is a quasi-ordering and ~ is its associated equivalence 
relation. 

The function J is "canonical" with respect to equivalence of presentations. That 
is: A'i* ^ A*; A* = B* <^ A^ = B'i; and A" « = A*. This justifies the terminology of 
Definition[2 

Lemma 1 A>:AK 



4 Saturated Systems 

By a "normal-form proof", we will mean a proof in ^11 A*. On account of Q, all 
subproofs of normal-form proofs are also in normal form. We propose the following 
definitions: 

Definition 4 (Saturation) A presentation A is saturated if it supports all possible nor- 
mal form proofs: T\A 3 ^WA* . A presentation A is complete if every theorem has a 
normal form proof: A* = A (JIA n pMA*). 

A presentation is complete if it is saturated, but for the converse, we need a further 
hypothesis: minimal proofs are unique if for all theorems c € HA there is exactly one 
minimal proof in pHA* with conclusion c. 

Proposition 6 If minimal proofs are unique, then a presentation is saturated iff it is 
complete. 

For example, suppose all rewrite (valley) proofs are minimal but incomparable. 
Then any Church-Rosser system is complete, since every identity has a rewrite prof, 
but only the full deductive closure is saturated. 
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Theorem 1 A presentation A is saturated iff it contains its own canonical presenta- 
tion: A 13 AK In particular, A^ is saturated. Moreover, A^ is the smallest saturated 
set: no equivalent proper subset of A^ is saturated; if A is saturated, then every equiv- 
alent superset also is. 

Corollary 1 Presentation A is saturated iff A* « A. 

Proof. It is always the case that A ^ A* ^ A'. If A is saturated, then ADA* and, 
therefore. A* 'f^ A^ A. For the other direction, suppose p G ^liA* . Since A is 
similar, there must be a proof q E HA C HA*, such that q < p. But q ft p, so p E HA. 
It follows that fiUA* C UA, and A is saturated. 

Lemma 2 Similar presentations are either both saturated or neither is; similar pre- 
sentations are either both complete or neither is. 

Proof. The first claim follows directly from the previous result. For the second, one 
can verify that A B implies: 

B* ^ A* ^ A{UAnnUA*) = A {^iUA r\ fiUA*) 

= AinTiBDnnB*) = A(^Bn^lnB*) 

Formulae that can be removed from a presentation — without making proofs 
worse — are "redundant": 

Definition 5 (Redundancy) A set R offormulce is (globally) redundant with respect 
to a presentation A when: AiJ R'^ A\R. The set of all (locally) redundant /ormMte 
of a given presentation A will be denoted pA: 

pA := {r E A: A>zA\{r}) 

A presentation A is irredundant if pA = 0. 

It can be shown that A is reduced iff it is irredundant. 

Lemma 3 The canonical presentation is reduced: pA^ = 0. 

Proposition 7 The following facts hold for all presentations A: 

A « A\pA (20) 

A^ = A*\ pA* (21) 

A^ = A{pUA*r\A*) (22) 

= ^lUA* n T* (23) 

It is thanks to well-foundedness of > that the set of all locally redundant formulae 
in pA is globally redundant (Ea.l20>. The alternate definition of the canonical set i22\ 
is made possible by the properties of subproofs. 

Theorem 2 A presentation is canonical iff it is saturated and reduced. 
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Proof. One direction follows immediately from Theorem ^ and Lemma |3] For the 
other direction, let A be saturated and reduced. We aim to show that A ^ AK By 
Lemma[2 A ^ A' and the two presentations are equivalent. If A is saturated, then by 
Theorem[T] A D AK By (|19}, for any r e A \ A >; A» ^ A \ {r}. But pA = 0, 
since A is reduced, so it cannot be that r e A. In other words, A\J^ = 0, and A is 
canonical. 

Returning to our simple example, we can add three inference rules for disequalities: 



i ^ k 



i ^ i 
j = k 



■ j=k 



With them, one can infer, for example, 7^ from 1 7^ 1. If F is smaller than 
other proof combinators, and / nodes are incomparable, then the canonical basis of 
any inconsistent set \s {i ^ j : i,j e N}. All positive equations are redundant. 



5 Variations 

Consider the above inference rules for ground equality and disequality: S, T, F, I, Z, 
with S extended to apply to all function symbols of any arity. Suppose we are using 
something hke the recursive path ordering for proof terms. 

Refutation. If the inference rule F is the cheapest in the proof ordering, T < I, and 
nodes are measured by the values of i and j, then the canonical basis of any 
inconsistent presentation is a (smallest) trivial disequation {t ^t}. 

Deduction. If the proof ordering prefers direct application / of axioms over all other 
inferences (including Z), then trivial proofs are best. In that case, pA* = and the 
canonical basis includes the whole theory A^ = A*. 

Paramodulation. If the proof ordering makes functional reflexivity S smaller than 
/, but the only ordering on leaves is I{u,t) < I{c[u],c[t]) for any context c, then 
the canonical basis will be the congruence closure, as generated by paramodulation: 
pA = {f{ui, ...,Un) = f{ti, . . . , i„) : ui = ti, . . . , u„ = t„ e A*}. The theory 
A* is the closure under functional reflexivity of the basis A^. If A is as in our first 
example, then A^ = {2j = : j > 0}. 

Completion. On the other hand, if the ordering on leaves compares terms in some 
simplification ordering then the canonical basis will be the fully reduced set, as 
generated by (ground) completion: pA = {u ^ u] iJ {u ~ t: t — v ^ A* , t 
V, V is not u}. For our first example, A" = {2 = 0}. For another example, if 

A = {a — c, sa = b} and sa ^ sb ^ sc a b c, then I{sa, b) > 
T{S{I{a, c)), /(sc, 6)), and hence A« = {a = c, sc = b}. 
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Superposition. If one distinguishes between T steps based on the weight of the 
shared term j, making T > I when j is the smallest, and T < I otherwise, then the 
canonical basis is also closed under paramodulation into the larger side of equations. 

6 Derivations 

Theorem proving with simplification entails two processes: Expansion, whereby any 
sound deductions (anything in E*) may be added to the set of derived theorems; and 
Contraction, whereby any redundancies (anything in pE) may be removed. 

A sequence of presentations Eq ^ Ei ■ ■ ■ is called a derivation. Let E^, = 
UiEi. The result of the derivation is, as usual |Bachmair and Dershowitz, 1994| , its 
persisting formulce: 

Eao liminf_Ej 

We will say that a proof p persists when Fp C E^o- Thus, if a proof persists, so do its 
subproofs (by|5Jl. By ( fT^ . we have liEi □ Hi?*. 

Definition 6 A derivation Eq ^ Ei ^ ■ ■ ■ is good ifEi ^ for all i. 

We are only interested in good derivations. From here on in, only good derivations 
will be considered. It is easy to see that: 

Lemma 4 Derivations, the steps of which are expansions and contractions, are good. 
Proposition % If a derivation is good, then the limit supports the best proofs: E^ w 

Proof. One direction, namely liEoo 3 Hi?* , follows by ( fT6t from the fact that Eoo C 
E^. To establish that WE.^. □ Ili^oo, we show that pI\E.^. □ Ili^oo and rely on ( fT2t . 
Suppose p e ^UE^. It follows from ( 151^ that Fp C Ep C pRE^ C ^lUE^. By 
goodness, each a E Fp persists from some Ei on. Hence, Fp C Eoc, andp € Fli^oo- 

Definition 7 A good derivation is fair if C [Eoo] □ Flii^* where C{E) is the set of 
critical proof obligations.- 

C{E) := {peUE-.p^ ^inE*, yq<]p.qe /iHS*} (24) 

It is clean if pE^ n E^o ~ 0- 

Critical obligations are proofs that are not in normal form but all of whose proper 
subproofs are already in normal form. Fairness means that all persistent obligations are 
eventually "subsumed" by a strictly smaller proof. 

Lemma 5 If a derivation is clean, then its limit is reduced. 
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Proof. Suppose, on the contrary, that some r 6 pEoc C Eoo Q . Consider f, and 
compare it to a smaller proof p g ni?oo ■ Let A = T p <Z Eoo Q E^ . Let q E fiHE^, . 
Were ?- G T q, then replacing r as a subproof of q with p, would by Q result in a 
smaller proof than q. It follows that r G pE^, which contradicts cleanliness. 

Lemma 6 If a derivation is fair, then its limit is complete. 

Proof. Any presentation A is complete if IIA □ IIA n pJ\A* . since a E A* implies 
a S A {U.A n /xH^* ), whence completeness. Let A = i?* be aU formulae proved at any 
stage in the derivation. We show that A is complete in the above manner Completeness 
of Eoo follows from Lemma|3 Consider any proof in p G HA of a. Let p^o G IliJoo Q 
HA be the persisting proof of a, for which poo < p by the previous proposition. If 
Poo G /illA*, we're done. Otherwise, poo has a minimal (with respect to <) non- 
normal-form (possibly trivial) subproof q, all subproofs of which (persist and) are in 
normal form. By fairness, there is a proof r G HA of the same theorem as q such that 
Poo > q > r. By (0, there is therefore a better proof p' < poo < P- By induction, there 
is a p" < p' in both HA and /xIIA*, also proving a. 

For example, suppose a proof ordering makes c > - and - > a. Start with 
i5o = {c}, and consider c. Were c to persist, then by fairness a better proof would 
evolve, the better proof being ^. If a is in normal form, then a G Eoo and both minimal 
proofs persist. Another example: /iP = {a,c, - } and E = {a}, then E E ■ ■ ■ 
is fair, since Eoo = and C{Eoo) = 0- The result is complete but unsaturated (c is 
missing). 

Together, these lemmata and Proposition|6|yield: 

Theorem 3 If minimal proofs are unique and a derivation is fair and clean, then its 
limit is canonical. 

By \22\ . this also means that each e G Eoo is its own ultimate proof e, so is not 
susceptible to contraction. 

Returning to our main example, if projection P is the most expensive type of infer- 
ence, then no minimal proof includes it. And if proofs are compared in a simplification 
ordering (subproofs are always smaller than their superproofs), then minimal proofs 
will never have superfluous transitivity inferences of the form 



Let ^ be a total simplification-ordering of terms, let P > / > T > 5 > Z in the 
precedence, let proofs be greater than terms, and compare proof trees in the corre- 
sponding total recursive path simplification-ordering. Ground completion is an infer- 
ence mechanism consisting of the following inference rules: 

Deduce: E\J{w = t[u\} E\J{w = t[v\} \f u = v E E anAu^ v 

Delete: E\J{t^t] E 

Furthermore, operationally, completion implements these inferences "fairly": No per- 
sistently enabled inference rule is ignored forever. 
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Corollary 2 (Completeness of Completion) Ground completion results — at the 
limit — in the canonical, Church-Rosser basis: E^o = E^. 

Proof. Ground completion is good, since Deduce and Delete don't increase proofs 
(-^C^). In particular, /(w, t[u]) > T{I{w, t[v]), ^"(/(u, v))) if u > w, since t[u] > 
t[v] and t[u] ^ u ^ v. Ground completion is fair and clean. For example, the critical 
obligation 

w — t t = V ^ 



when t ^ w,v,\s resolved by Deduce. Also, since T > S, non-critical cases resolve 
naturally: 

W = t t = V W = t t = V 

fw = ft ft ~ fv > W = V 

fw = fv fw = fv 



7 Discussion 

We have suggested here that proof orderings, rather than formula orderings, take center 
stage in theorem proving with contraction (simplification and deletion of formulae). 
Given a proof ordering that distinguishes "good proofs" from "bad proofs", it makes 
sense to define completeness of a set of formulae as the claim that all theorems enjoy 
a smallest ("best") proof. Then an inference system is complete if it has the ability 
to generate all formulae needed for such ideal proofs. Given a formula ordering, one 
can, of course, choose to compare proofs by simply comparing the multiset of their 
assumptions.' 

The notion of "saturation" in theorem proving, in which superfluous deductions are 
not necessary for completeness, was suggested by Rusinowitch ri989' pp. 99-100] in 
the context of a Horn-clause resolution calculus. In our terminology: A presentation 
was said to be saturated when all inferrible formulae are subsumed by formulae in the 
presentation. This concept was refined by Bachmair and Ganzinger (see, most recently, 
^2001 1) and by Nieuwenhuis and Rubio 12001 pp. 29^2]. They define saturation in 
terms of a more general kind of redundancy: An inference is redundant if its conclusion 
can be inferred from smaller formulae; a presentation is saturated if every inference is 
redundant. 

We propose alternate definitions of saturation and redundancy, defining both 
in terms of the proof ordering. This appears to be more flexible, since it al- 
lows small proofs to use large assumptions. The definition of redundancy in 
INieuwenhuis a nd Rubi o, 200 1| coincides with ours when proofs are measured first 
by their maximal assumption. 

In IBachmairand D ers howitz, 1994| , a completion sequence is deemed fair if all 
persistent critical inferences are generated. In | |Nieuwenhuis and Rubio, 200 1| fn. 8], 

'a well-founded proof ordering on P can induce a well-founded foiTnula ordering on A: Define a ^ c 
if there exist p,q G fiP such that q\>p,Ap = a, and Aq = c. 
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an inference sequence is held to be fair if all persistent inferences are either generated or 
become redundant. The definition of fairness propounded here combines the two ideas. 
But fairness only earns completeness, not saturation. (A stronger version of fairness 
is needed for saturation when the proof ordering is partial.) Our definition of critical 
obligations also allows one to incorporate "critical pair criteria" |Kapure?a/., 1988) 
pjachmairand Dershowitz, 1988J . 
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